Reliability Modeling of Large Fault-Tolerant Systems

نویسندگان

  • Neeraj Suri
  • M. M. Hugue
  • Chris J. Walter
چکیده

A cluster based ultra reliable architecture is pre sented o ering synchronization and system function ality comparable to that of fully connected systems with reduced system overheads Existing combina torial and Markov models do not su ciently model concurrently occurring faults in such large systems A reliability model considering the distribution of con current faults across the system clusters is shown to increase the accuracy of reliability and system fault tolerance estimates The hybrid fault model which classi es faults based on their behavior further im proves reliability estimates and enhances the fault handling capability of each cluster Linear growth in cluster reliability with respect to cluster size is possi ble as are re nements in the convergence and consis tency algorithms for synchronization

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mathematical modeling and fuzzy availability analysis for serial processes in the crystallization system of a sugar plant

The binary states, i.e., success or failed state assumptions used in conventional reliability are inappropriate for reliability analysis of complex industrial systems due to lack of sufficient probabilistic information. For large complex systems, the uncertainty of each individual parameter enhances the uncertainty of the system reliability. In this paper, the concept of fuzzy reliability...

متن کامل

Techniques for Modeling the Reliability of Fault-Tolerant Systems With the Markov State-Space Approach

This paper presents a step-by-step tutorial of the methods and the tools that were used for the reliability analysis of fault-tolerant systems. The approach of this paper is the Markov (or semi-Markov) state-space method. The paper is intended for design engineers with a basic understanding of computer architecture and fault tolerance, but little knowledge of reliability modeling. The represent...

متن کامل

Coverage-based testing strategies and reliability modeling for fault-tolerant software systems

Software permeates our modern society, and its complexity and criticality is ever increasing. Thus the capability to tolerate software faults, particularly for critical applications, is evident. While fault-tolerant software is seen as a necessity, it also remains as a controversial technique and there is a lack of conclusive assessment about its effectiveness. This thesis aims at providing a q...

متن کامل

Reliability Growth of Fault - Tolerant Software

Two fault-tolerant software techniques are investigated: recovery block and N-version programming. For each, the stable reliability model is transformed into a model that considers reliability growth via the transformation approach based on the hyperexponential model. Analytic and numeric processing of the transformed models identify the influence of fault removal on the reliability of the faul...

متن کامل

Proceedings of the 2005 International Conference on Simulation and Modeling

Reliability enhancement in software system is a crucial and challenging issue. Applying efficient fault-tolerant mechanism can fulfill the system reliability requirement. This paper proposes reliability models for hierarchical and hybrid fault-tolerant software systems considering failure dependencies or related faults in software components/versions. Our system models are based on the classica...

متن کامل

Reliability and Performance Evaluation of Fault-aware Routing Methods for Network-on-Chip Architectures (RESEARCH NOTE)

Nowadays, faults and failures are increasing especially in complex systems such as Network-on-Chip (NoC) based Systems-on-a-Chip due to the increasing susceptibility and decreasing feature sizes. On the other hand, fault-tolerant routing algorithms have an evident effect on tolerating permanent faults and improving the reliability of a Network-on-Chip based system. This paper presents reliabili...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1992